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Acceptance of Morbidity Statistics. In 1971, Sterling published an article 
in the Journal of the American Statistical Association (66: 251-257), analyzing 
the statistics used in the earlier documents. The article is reproduced in 
its entirety. 


A Review of the Claim that Excess AAorbidity and 
sterling* Disability Can Be Ascribed' to Smoking 


THEODOR D. STERLING* 
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1. INTRODUCTION 

In 19G7 the U. S. Department of Health* Education* 
l and Welfare published ift report entitled Cigarette Smoking 
' and Health Characteristics which offers 

•* . .statistics .. . on selected health chs net eristics and cigar- 
cite sacking habits of the civil is a, ; ; on:::5;i:utioaal papula* 

. lion 17 years and over in vne United Stales during the period 
. July 190-Wune I9C3 [10,.title page). ; 

The report is careful not to claim that it establishes a 
relationship between smoking and chronic and acute ill¬ 
nesses and disabilities. However, it has become the basis 
* for a number of claims that smoking causes a large variety 
of illnesses and disabilities. These claims are summarized 
..in an official report by 1IEW which makes specific claims 
about the total days Americans arc supposed to have lost 
from work or spent at home “because cigarette smokers 
have highor disability rales than non-smokers” [22]. 
These figures are large—77 million dhys lost from work, 
SS million' days spent in bed, COG million days of re¬ 
stricted'activity. While none of these figures even appear 
in the original study (they are estimates said to be based 
on figures contained in that study although the estimating, 
procedures arc nowhere published), the same figures are 
ilUo used widely and continuouslyin pamphlets circulated! 
by the National Clearinghouse for Smoking and Health 
and are repeated in a recent information publication 
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prepared by a private contractor ($]. Vet. the data pro¬ 
vided by the National Center are extremely ambiguous. 
(With clever manipulation, they could be made to sup¬ 
port the contention that smoking is even beneficial to 
heal t hi) 

Specifically, the study on cigarette smoking and mor¬ 
bidity suffers from two major flaws: 

1. The data actually presented in Cigarette Smoking and 
Health Characteristics may not. in fact, warrant the 
’conclusion that cigarette smoking leads to increases 
in disease and disabilities: 

a. Female smokers, taken as a group, by and large 
report fewer diseases and disabilities and their 
prevalence rates, computed on the basis of these 
reports, are generally smaller than those lor non- 
smokers. 

b. Moderate smokers (those who consume less than> 
a half or less than a full pack a day)i very often 
report the fewest number of diseases and have 
the lowest prevalence rates for both men and' 
women. If the conclusions of the report hold, 
they do so for heavier smokers only who com¬ 
prise considerably less than half the sample of 
smokers. 

2. Inadequate attention has been paid to the potential 
influences of various aspects of the survey methods 
on the findings. Specifically, there appear to be a 
number of obvious sources of bias and error for 
whichiadjustments were not made. 

The sampling plan of the National Health Survey 
follows a multi-stage design which permits a continuous 
sampling of the civilian population of the United Slates. 
Tlio first stage of this design consists of drawing a sample 
of 337 from about 1,000 geographically-defined Primary 
Sampling Units into which the United States has been 
divided! The PSU is a county, a group of contiguous 
counties, or a standard metropolitan! statistical area. 
Within rSUis,,ultimate stage units, called segment?, are 
defined in such ;i manner that each segment contains an 
expected 000 households. Iktch week a sample ufiabout 00 
segments is drawn. Household members in the approx:- 
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Excess Morbidity, Disability, and Smoking 

It is thus not sufficient to report an increase in disease 
among heavy smoker.*: without clarifying if smoking in 
or*yamount results in an increase in disease. 

^ What is moderate" or "excessive" smoking is difficult 
"‘o define. \Yc have taken a reported smoking rate of one 
half pack or less as moderate but include also the group 
reporting a smoking rato of one half to a full pack a dav. 
In so dbing, we clearly deal with the majority of smokers, 
so that observations on them ought to receive proper 
Attention. After all, no fewer than 72.0 percent of the 
males and SS.5 percent of the female smokers reportedi 
that they smoked a pack or loss a day presently. Although 
these percentages decline somewhat if smokers are 
grouped by heaviest amount smoked daily, those who 
smoke less than one pack a day make up the overwhelm- 
ing majority of the smoking population. (Relevant figures 
are summarized in [lG, Table 3; pp. 27—S j.) 

The report stresses that rates of most diseases and 
disabilities increase with increases in the number of 
cigarettes smoked. In a way, this is correct. There is this 
parallel increase in disease and smoking. The trouble is 
that, as often as not, non-smokers report disease rates 
that arc greater than those reported by smokers who con¬ 
sume less than a 1 half or full pack a 1 day. Table 2 com¬ 
pares observed prevalence rates and prevalence rates 
adjusted for peculiarities and difference in age distribu¬ 
tion between moderate smokers and non-smokers. 
^Moderate smokers, whether male or female, tend to rc- 
Jport less disease than dt> non-smokers. Similar observa- 

Tobte ?. ESmUTED UNADJUSTED AND AGE-ADJUSTED 

prevalence rates for different chronic conditions 
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tions arc trim for reports of disabling confinements. The 
number of disease categories for which moderate smokers 
report a smaller prevalence differ for adjusted and urn 
adjusted prevalence rates. Of those who smoke one half 
pack or less, males reported fewer diseases in six out of 
seven disease categories (adjusted or unadjusted) arid 
females reported fewer diseases in six out of eight. For 
smokers who consume between a half and a full' pack a 
day, males reported fewer diseases in six out of eight 
disease categories and females in five out of eight (for 
unadjusted)' and three out of eight (for adjusted prev¬ 
alence rates). 

Of course, the non-smoker category* may include a cer¬ 
tain number of persons who do not smoke because of 
chronic illness. Yet, there is no substantive evidence 
offered that this is so or that moderate smoking is not 
prevalent- among individuals who are ili from a number of 
causos. On the other hand, we cannot explain the higher 
prevalence of morbidity among non-smokers because a 
certain number of tlic very ill smokers may have been 
urged or forced to join the non-smokers. We purposely 
selected persons who never smoked to compare with' 
moderate smokers. In short, while the peculiarities in the 
relationship between the amount smoked and the number 
of diseases reported make it difficult in the extreme to 
explore and explain these data, it is not easy to see what 
purpose could be served by hiding this pertinent result bv 
a blanket statfcmcut that the increase in disease appears 
to be related to the amountsraokcdl 

3. WHAT WERE THE POTENTIAL INFLUENCES OF THE SURVEY 
METHODS ON THE FINDINGS? 

The National Center for Health Statistics has been in. 
the forefront of developing and investigating probability 
sampling methods, attendant survey methodology, and 
sources-of error and ways to adjust for them. For this 
reason, it is not easy to understand why so many poten¬ 
tial sources of bias were simply ignoredi This is doubly 
surprising because the basic methodological investigations 
that have shown only a small correlation between the 
incidonce of rncdically-documcnted disabilities and the 
estimated number of these same diseases derived from 
questionnaire interview methods corr.c from t\vo widely- 
known and wcll-respcctcd studies sponsored: by and 
published through the NCI13; These detailed studies had 
shown that: proxy responses tend to suffer from errors 
roughly twice the size of those which result frorm self 
respondents [13,. l-l]. One of these studies is especially 
relevant because it demonstrated that "psycholbgical^ or 
•sociological" 1 factors have profound c/Tects on the size of 
errors in. responses of proxies, much more so than on 
responses that came from self respondents [u]. 

An interview technique which obtains data about a 
larger population by intfcrvicwing those members of a 
household! wlio are fortuitously present when the inter¬ 
viewer calls was, in fact, recognized during the inception 
of the National Health Survey as a serious source of error 
ami the budgetary limitationsforcing this procedure upou 
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the survey properly lamented [ll, 12]; Yet, national 
estimates on the cfTcct of smoking are based on these 
same data, with the fond hope that errors inthem are 
randomly distributed throughout all categories of in¬ 
formation so that estimates of different categories (such 
as for smokers and non-smokers) arc not affected differ¬ 
entially. 1 Given.the public nature of debate on smoking 
and health in. the press and on television, there is surely a 
good possibility that proxy respondents may be affeoted 
by opinions and beliefs concerning smoking and, accord- 
ingly, color their answers tb interviewers. Similar prob¬ 
lems exist for such obvious potential sources of bias as 
employment or non-employment or for other “charac¬ 
teristics” of respondents. 

* 3.1 The Smoking-Proxy Bios 

The survey obtained records for 41,743 males. Of these, 
29,322 were reported to be smokers. Only 12,4-10 smoking 
males reported directly about their disease and dis¬ 
abilities. Of the smoking males for whom information was 
obtained by proxy, the largest group had information ob¬ 
tained from their spouse. Counting only wives who either 

* smoked or did not: at the time of the interview, there were 
11,440 such male smokers, comprising approximately 40 
percent of the sample of malt smokers. It may be reason¬ 
able to suppose that among non-smoking, informants 
(mostly wives) there are those who do not smoke because 
of the belief that smoking could be harmful. By com¬ 
paring the number of diseases reported for smokers by 
informants who do smoke and informants-who-do not 
smoke, we can isolate to some extent the effect of the 
belief (that may be held cousciously or not) thatismoking 
5s harmful in terms of the number of diseases each group 
reports for their smoking husband. 

The 11,440 male smokers were divided into the 5.G7S 
proxies whose wives had never smoked and the 5,5G2 
proxies whose wives smoked at the time of: interview. 
The percentage of smoking males for whom one or more 
chtonic conditions were reported by the wife, the per¬ 
centage for whom different kinds of chronic conditions 
were reported, and the average number of chronic condi¬ 
tions reported for eachi proxy for whom diseases were 
reported arc shown in Table 3. As can be seen, non¬ 
smoking wives uniformly reported more disease lor their 
smoking husbands than did the wives who themselves did 
smoke. But not onty were more individuals reported ns 
having disease by non+smoking than by, smoking wives,; 
but the number of diseases reported for each husband was 
also affected by whether or not the wife smoked. 

The comparison of the number of diseases reported by 
smoking and noinsmoking informants is of special signi¬ 
ficance for the category, of bronchitis/emphysema. Since 
smokers oftcu cough, the question may be raised to what 
extent this cough is perceived by non-smokers, as a sign 
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Table 3. PERCENT OF PROXY SMOKING MALES FOR WHOM 
CHRONIC CONDITIONS, RESTRICTED ACTIVITY, CONFINE¬ 
MENT TO BED, AND LOST DAYS OF WORK V/ERE REPORTED 
BY SMOKING AND NON-SMOKING WIVES* 
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of disease while it may be largely ignored by smokers as a 
simple response to irritation. Certainly it would: be un¬ 
reasonable to suppose that the wife's failure to smoke 
causes an increase in the number of instances of husbands 
with bronchitis and emphysema (or of any other chronic 
disease). 

Very much the same picture emerges when we compare 
the number of restricted activity days, days spent in bed,, 
and days lost from work and the average number of such 
lost days reported by wives who smoke with the number 
reported by wives who do not smoke. The percentage of 
males for whom it: is reported that they have lost days is 
uniformly smaller if the wife reporting these disabilities 
docs smoke than if the wife docs not. The same is true for 
average number of days reported with the exception of 
work-loss days where smoking wives report more work 
days lost than nonsmoking wives. _ 

Since the difference in the raw prevalence of reported 
diseases and: disabilities as described in [l6] is often no 
larger and sometimes even smaller than the prevalence in 
reported disease and disabilities between smoking; and 
nomsmoking proxy informants, it is not impossible that 
the observed difference in prevalence between smokers 
and non-smokers may have been due to these psycho¬ 
logical factors. In any case, it seems a pity that the proper 
adjustment for this factor was not included liii tile analysis 
cflhoNCllS data. * 
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Excess Morbidity, Disability, end Smoking 
3.2 The Adequacy of Age Adjustment Methods 

C There are several methods of comparing two sets of 
,tcs when large confounding effects may diiTcrcntiallv 
distort them. One of the obvious confounding effects in 
' comparing disease rates is age. Disease rates are obviously 
“ related to age, with the highest rat os occurring among the 
very' young and the aged. If the age distribution for 
sfnokcrs and non-smokers were similar^ then rates of dis¬ 
ease among them could be compared directly. However, 
the age distributions of sampled smokers and non-smokers 
were quite different. Thus, observed differences in disease 
rates between smokers and non-smokers may be related to 
differences in their age distributions rather than to the 
fact that they differ in smoking habits. The data'im the 
report Cigarette Smoking ana Health Characteristics were 
adjusted, therefore, for peculiarities of their age cliitribu- 
iion.There arc, however, two objections to the analyses in 
the report. 

Adjustments do not go far enough. There are a number 
of differences between smokers and non-smokers other 
than age and sex that need to be considered. Observed 
differences in rates between smokers and non-smokers 
may be related to differences between the two groups on 
such factors as employment status, directness of inter¬ 
view, characteristics of proxy informants, and so om 
Secondly, the indirect method of age adjustment used in 

C Sc report is most easily affected by extraneous factors 
nd ought not to have been used. 

The figure gives some idea of the differences between 
male smokers and non-smokers on employment status 
and directness of interview by age. (The distributions are 
as markedly different for females, but not shown here.) 
Although we cannot simultaneously adjust for all these 
factors freyn the data at hand; some indication of the 
effect of the employed-unemployed!dimensionion reports 
of diseases and disabilities may be inferred from Table 4. 
The effect of differences in age distributions by employ¬ 
ment status or directness of interview on number of re¬ 
ported diseases may well mask a much more: profound 
cfTcct due to smoking or,,quite to the contrary, cause such 
an effect to appear when none actually exists. Consider¬ 
ing the profound.differences in age distribution existing 
between employed and unemployed members of the 
sample or between direct and proxy, respondents, it is 
difficult 10 judge what indirect age adjustment actually 
accomplished. Jt would have been more appropriate under 
the circumstances to use a direct age adjustment method! 
or compare agc-specific. disability rates. (All other reports 
on the same data published by XCI?.S do report more 
detailed nge-speciMc rates;or use a direct method of age 
^adjustment that reflects agc-spccific rates more directly, 
the indirect.method of age. adjustment used only in 
tliis report and no other. This npurt does pre-ent age- 
spcciHc data, but fur large age ranges as 17 to-1 I ti -bj totil, 
and over (TV years, which is: really not of much help.) 
Detailed age-speeine rates then should 1 have dvim shown 
and adjusted for iiiflurticcs on n>poH"CS due to employ¬ 
ment and proxy status at the very least. 
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AGE D/S7K/BUT/OM OF VALES—BY” EMPLOYMENT STATUS 
AND BY DIRECTNESS OF INJERVIEY/' 
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3.3 How Many Smokers Are There? 

The prevalence of: smoking in the XHS sample is of 
prime importance since, after all, smoking is the inde¬ 
pendent variable inithe comparison. Vet, nowhere do we 
find an attempt to assess the accuracy with which num¬ 
bers of smokers or non-smokers were obtained. There are 
two well-known studies available that make comparable 
observations of the distribution of smoking in the U. S. 
population. It is interesting that both differ quite em¬ 
phatically from the age distribution of smokers and non- 
smokers obtained from, the household interview [5*. 7). 
These differences are Mimmarized in Table 5 j Again, the 
difference is mucin larger and much' more pronounced 
among males than iVmnliv*. nm.-i likely because there were 
many more proxies among iln* former. 

These datev necth not indicate that the XCIIS survey 
wan- plagued hy I'xropiimialty large errors in counting 
smokers and non-Mnok-ws. After all, the error may have 
been in Hammond's or Ifean^zel's studies. However, it is 
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- r«We 4. PERCENT OF SAMPLE POPULATION EXPERIENCING DISABILITIES , BV EMPLOYED AND UNEMPLOYED' 
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clcar tint even for such a simple question as whether an 
individtial has ever smoked, large errors appear if a data 
gathering procedure relies heavily on proxies. Again, it 
would have been important for the credibility of the 
study if more attention had been given to the accuracy of 
smoking histories and discrepancies between the survey’s 
results and that of other studies examined. 


4. CONCLUSIONS 

Surely it is important to know to what extent smoking 
harms the population of this country. It may be even that 
given existing budget limitations, better methods of 
inquiry thani a household sampling survey may not be 
available. Yet, it hardly helps to clarify this problem if 
data resulting from methods that are known to produce 
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Table 5. COMPARISON Of NON-SMOKERS' AGE DISTRIBUTION* 
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Excess Morbidity, Disability, and Smoking 

large errors and are possible 'sources of bias are subjected 
to inadequate and perhaps inappropriate analyses and not 
^N>ugh attention is paid to the potential; influences of 
^.rious aspects of the-survey'methods on these findings. 
Perhaps if any lesson is to be learned from this issue, it is 
not that smolung causes untold diseases *and disabilities 
. "and loss from work, but that claims, about such con¬ 
troversial topics need to be carefully and intelligently 
reviewed, even if they arc made by public agencies or by 
other generally reliable sources. 
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